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Abstract 

A study of the parameterization uncertainty at low Bjorken x < 0. 1 for the parton 
distribution functions of the proton is presented. The study is based on the HERA I 
combined data using a flexible parameterization form based on Chebyshev polynomi- 
als with and without an additional regularization constraint. The accuracy of the data 
allows to determine the gluon density in the kinematic range of 0.0005 < x < 0.05 
with a small parameterization uncertainty. An additional regularization prior leads to 
a significantly reduced uncertainty for x < 0.0005. 



1 Introduction 



The accurate knowledge of the parton distribution functions (PDFs) plays an important role for 
predictions of hard scattering cross sections at pp and pp colliders. The latter are computed in the 
perturbative approach including higher order radiative corrections, e.g. at next-to-leading order 
(NLO), which results in reduced theoretical uncertainties. Particular cross sections, such as Drell- 
Yan production of W, Z bosons at the LHC are even calculated to next-to-next-to-leading order 
(NNLO), see and exhibit a small theoretical uncertainty of ~ 2%. For these processes, the 

accuracy of the prediction is presently limited by the uncertainties of the PDFs. 

The PDFs being non-perturbative by definition can be determined from fits to data from DIS e- 
and v-scattering, and from Drell-Yan experiments. These fits are performed using the well-known 
QCD evolution equations at NLO and NNLO j3]-[8l. The data are provided at discrete values of 
Bjorken x and absolute four momentum transfer squared Q 2 with their statistical and systematic 
uncertainties. With this given input, the uncertainty of the PDFs due to experimental errors are 
estimated using Hessian Il9iri0ll and Monte Carlo ifTTI methods. Additional theoretical uncertainties 
arise, e.g. from unknown higher orders in the evolution or the treatment and scheme choice for 
heavy flavor contributions [fT2llT3l . These need to be considered separately (see e.g. |fl4l[T5lO . 

PDF fits require an ansatz for the parameterization by a certain function of x at the starting scale 
Qq of the evolution. Fitting of experimental data at discrete points with an, in general, arbitrary 
function is an ill-posed problem which requires regularization. Typically Regge-theory inspired 
parameterizations are used with a small number of parameters which implicitly contain smooth 
and regular behavior requirements for the PDFs. For these parameterizations, it is difficult to es- 
timate the PDF uncertainty arising from the choice of a particular ansatz. Alternatively, flexible 
parameterizations based on a neural net approach were used recently ifToll . The number of param- 
eters in this approach is determined by the data using an over-fitting protection technique which is 
an implicit regularization. 

In this note, a new study of the parameterization uncertainty at low Bjorken x < 0. 1 is per- 
formed. An explicit regularization prior is introduced which disfavors resonant-like behavior of 
PDFs at low x and the impact of the prior on the parameterization uncertainty is evaluated with 
particular emphasis on the gluon density in the range 0.0005 < x < 0.05. We choose a flexible 
ansatz for the PDFs at low x using Chebyshev polynomials. The analysis is based on the combined 
HERA I data El. 

2 QCD Analysis Settings 

The QCD analysis presented here is performed using as a sole input the combined HI and ZEUS 
data on neutral and charged current e ± p scattering double-differential cross sections collected 
during the HERA I run period of 1994-2000 IfTTTl . The kinematic range of the data extends from 
0.045 < Q 2 < 30000 GeV 2 and 0.000006 < x < 0.65, however in the QCD fit analysis only data 
with Q 2 > <2m in = 3.5 GeV 2 are considered in order to minimize the non-perturbative higher twist 
effects. 

The QCD fit is performed within the framework of the QCDNUM program implemented at 
NLO in QCD lfT8ll and using a Zero-Mass-Variable-Flavor scheme. The fit minimizes a % 2 function 
as specified in IfTTll . The PDFs are parameterized at the starting scale of Qq = 1.9 GeV 2 . We use a 
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flavor decomposition similar to lfT9l as follows: xd va u xu va i, xA = xu—xd, xS = 2{xu + xd + xs + 
xc + xb) where c and b quark densities are zero at the scales below their corresponding thresholds. 
The PDFs are evolved in Q 2 using the NLO equations in the massless MS-scheme and the charm 
and beauty quark PDFs are generated by evolution for scales above the respective thresholds. The 
renormalization and factorization scales are set to Q 2 . 

Since this study is focused on the low x < 0.1 region, the set-up for the QCD analysis is 
special if somewhat simplified compared to modern high precision determinations of PDFs, see 
e-g- lfT7ll20l - l2"2l w At low x the PDFs are dominated by the gluon and sea-quark densities while at 
high x the valence-quark densities give larger contribution. Thus, regarding the functional form for 
the PDFs, standard Regge-theory inspired parameterizations are used for the valence quarks: 

xu v {x) = A Ur x? u *(l -x) c "\ (1) 
xd v (x) = A*/* (1 (2) 

The low-x behavior of the valence densities is assumed to be the same for u and d quarks by 
setting B Uv = 5^, . The normalizations A Uv and A^ v are determined by the fermion number sum 
rules. Therefore the valence sector is described by three parameters. 

For the gluon and sea densities a flexible Chebyshev polynomials based parameterization is 
used. The polynomials use logx as an argument to emphasize the low x behavior. The parameteri- 
zation is valid for x > x m i n = 1.7 x 1(T 5 which covers the x range of the HERA measurements for 
Q 2 > 2min- The PDFs are multiplied by (1 —x) to ensure that they vanish as x — > 1. The resulting 
parameterization form is 

f\ a ft \ X* a t( 2l0gJC-l0gJC min \ 

xg(x) = A g (l -x) L A Si T i r— , (3) 

;= V iogx min j 



vf \ ft \W t( 21ogx-logjc min \ 
xS(x) = (1 -x) ^ A Si Ti — — 



(4) 

!=0 

where Tj denote Chebyshev polynomials of the first type and the sum over i runs up to N g s =15 
for the gluon and sea-quark densities. The Chebyshev polynomials are given by the well-known 
recurrence relation: 

To(x) = 1, (5) 
7i(x) = x, (6) 
T n +i(x) = 2xT n (x)—T n -i(x). (7) 

The normalization A g is determined by the momentum sum rule. The advantage of the parameter- 
ization given by Eqs. ©, © is that momentum sum rule can be evaluated analytically. Moreover, 
already for N g> s > 5 the fit quality is similar to that of a standard Regge-inspired parameterization 
with a similar number of parameters. 

The PDF uncertainties are estimated using the Monte Carlo technique [fTTTl . The method con- 
sists in preparing replicas of data sets by allowing the central values of the cross sections to fluc- 
tuate within their systematic and statistical uncertainties taking into account all point-to-point cor- 
relations. The preparation of the data is repeated for /V > 100 times and for each of these replicas 
a complete NLO QCD fit is performed to extract the PDF set. The PDF central values and un- 
certainties are estimated using the mean values and root-mean-squared (RMS) over the PDF sets 
obtained for each replica. 
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3 Choice of the Smoothness Constraint 



Fitting an arbitrary function to a discrete number of measurements is an ill-posed problem which 
requires regularization. This regularization should have a physical motivation and be flexible 
enough to cover the space of solutions compatible with QCD. At low x, the sea-quark PDF closely 
corresponds to a measurement of the structure function F2 in a DIS process. For DIS at low x, the 
invariant mass of the hadronic final state W is calculated from Q 2 and x as 



W = \/Q 2 ^. (8) 

Experimentally, it is well-known that for low values of W and Q 2 the structure function F2 
displays resonances [23]. These resonances, however, disappear for high W > 5 GeV. The smooth 
behavior of F2 for high W can be explained phenomenologically by high particle multiplicity of the 
hadronic final state. A prior which disfavors resonant structures in W, for W exceeding a certain 
value Wmin, has therefore a strong phenomenological motivation. This prior can be introduced as 
an additional penalty to the likelihood function for the PDFs which are longer in W. Note that a 
prior using the length in W as opposed to the length in x enhances sensitivity to the low x region. 
For the % 2 function the prior corresponds to an extra penalty term of a form 
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Xprior ^ 



W'n- 



1 + (xf'(W)) 2 dW - (WW - W min ) 



(9) 



where a is the relative weight of this PDF- length prior and the PDF xf = xg.xS, respectively. The 
prior Xprior nas a minimum for the shortest PDF in W which corresponds to a condition for the 
derivative, xf'{W) = 0. In this case, Xprior = holds irrespective of the value of a. The total % 2 0t is 
given by the sum of the % 2 , for the data versus theory comparison, and the penalty term 

Xtot = % +Xprior- (10) 

We choose W max = 320 GeV which is the maximum value achievable at HERA. To stay far 
away from the resonance region, W m m = 10 GeV is used which for Q 2 = 1.9 GeV 2 corresponds 
to x ~ 0.02. The prior is applied to both gluon and sea-quark densities at the starting scale Qq = 
1.9 GeV 2 of the evolution. 



4 Results 

The Monte Carlo procedure of extracting PDFs is illustrated for N par = N g = Ns = 9 in Fig. Q] 
which shows the gluon PDF at the starting scale Qq = 1.9 GeV 2 for each replica, with their RMS 
band. The distributions are compared to those obtained using the standard parameterization form: 
xg(x) = A g x B s(l —x) c «, xS(x) = A s jc B *(l — x) Cs , where A g is determined by the momentum sum 
rule. The flexible parameterization does not suppress minima and maxima of the distribution as a 
function of x, as a result, several of them are observed. 

Introducing the length prior to the fit by changing the weight of the penalty term from GeV -1 
to 1000 GeV -1 increases the % 2 of the fit by 66 units, see Tabled! Further increase of the penalty 
term to a = 5000 GeV -1 reduces fit quality considerably with an additional increase of % 2 by 141 
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Cheb=9, Q 2 = 1 .9. GeV 2 Cheb=9, Q 2 = 1 .9. GeV 2 




Figure 1: The gluon PDF xg(x) at the starting scale <2q = 1.9 GeV 2 . The green lines show fits to individ- 
ual replicas of the data, the red lines show the RMS over the replicas. The black lines correspond to the 
error band of the gluon distribution using a standard parameterization and it is to be compared to the case 
of the Chebyshev parameterization. On the left hand side, the gluon distribution is shown using an uncon- 
strained Chebyshev expansion to order nine, see Eq. ©, while on the right hand side the same distribution 
is displayed but with a tight length penalty a = 5000 GeV -1 applied. 




Figure 2: The central value of the gluon PDF for various values of the length-prior weight a at the evolution 
starting scale Q 2 = 1.9 GeV 2 using the Chebyshev parameterization expanded to the 15?/i order. The vertical 
dashed line shows the x value corresponding to the W m [ B = 10 GeV limit. 
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Table 1 : The quality of the fit in terms of % 2 



/«df as a function of the length-prior weight a. 



a, GeV" 1 
% 2 /"df 



10 100 1000 5000 

560/557 561/557 572/557 626/557 767/557 



units. For low values of a < 100, the impact of the penalty term on the shape of the central value 
of the gluon PDF is small while for a > 1000 the distribution changes significantly, see Fig.[2l In 
addition, the shape of the gluon distribution using a standard parameterization can be reproduced 
by the Chebyshev parameterization of the gluon PDF if a tight length prior is applied to the fit, as 
demonstrated in FigJT] 

Fig. \T\ shows that the PDF uncertainty is very large for x < 0.0002 and x > 0.05 while for 
intermediate values of x the data constrains the gluon PDF well. To quantify the dependence of the 
uncertainty as a function of N gj s, the RMS values at fixed values of x=0.0002, 0.001, 0.01 and 0.1 
are examined. The results of this investigation are summarized in Fig. |3] For x=0.0l and 0.001, 
the uncertainty stays approximately constant as the number of parameters increases while for x = 
0.0002 and x = 0. 1 it increases significantly, if no penalty term is used. The impact of the penalty 
term on the uncertainty of xg(x) is large at low x. Even low values of a < 100 GeV -1 significantly 
reduce the uncertainty for x = 0.0002 at large values of N par , while for x = 0.1 the penalty term 
has no impact. For the intermediate x values, the change of the uncertainty is moderate, indicating 
that for p < 100 GeV 1 the data provides stronger constraints on the gluon PDF than the prior. 

5 Summary 

The focus of this study has been on the parameterization uncertainty of PDFs at low x, especially 
of the gluon PDF in a fit to the HERA I data at NLO in QCD. A flexible PDF parameterization 
based on Chebyshev polynomials has been chosen and the impact of an additional smoothness 
prior on the quality of the has been investigated. We have found that the uncertainty of the fit is 
generally small in the 0.0005 < x < 0.05 range. The uncertainty, however, increases significantly 
for larger and smaller x values. The regularization with a smoothness prior, which disfavors reso- 
nant structures for large values of W allows to significantly reduce uncertainty also for the range 
x < 0.0005. 
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Figure 3: The uncertainty of the gluon PDF as a function of N par =N g = Ns for Q 2 = 1.9 GeV 2 at fixed 
values of x for different values of the length-prior weight a as indicated by the figure's legend. 
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